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Abstract 

In this paper, we analyze the capacity of multiple-input multiple-output (MIMO) Rayleigh-fading 
channels in the presence of spatial fading correlation at both the transmitter and the receiver, assuming 
that the channel is unknown at the transmitter and perfectly known at the receiver. We first derive the 
determinant representation for the exact characteristic function of the capacity, which is then used to 
determine the trace representations for the mean, variance, skewness, kurtosis, and other higher-order 
statistics (HOS). These results allow us to exactly evaluate two relevant information-theoretic capacity 
measures — ergodic capacity and outage capacity — and the HOS of the capacity for such a MIMO 
channel. The analytical framework presented in the paper is valid for arbitrary numbers of antennas 
and generalizes the previously known results for independent and identically distributed or one-sided 
correlated MIMO channels to the case when fading correlation exists on both sides. We verify our 
analytical results by comparing them with Monte Carlo simulations for a correlation model based on 
realistic channel measurements as well as a classical exponential correlation model. 

Index Terms 

Channel capacity, higher-order statistics (HOS), multiple-input multiple-output (MIMO) system, 
Rayleigh fading, spatial fading correlation. 

I. Introduction 

Multiple-input multiple-output (MIMO) communication systems using multiple transmit and 
receive antennas promise high spectral efficiency and link reliability for wireless communications 
[l]-[3]. Although the linear growth of capacity with the number of antennas indicates the 
potential of MIMO systems, the true benefits of the use of multiple antennas may be limited 
by spatial fading correlation due to closely-spaced antenna configurations and poor scattering 
environments in realistic wireless channels [4], [5]. 

Since the pioneering work of [l]-[3] in the area of multiple- antenna communications pre- 
dicted remarkable spectral efficiency of MIMO wireless systems in independent and identically 
distributed (i.i.d.) Rayleigh fading, much subsequent work has concentrated on characterizing 
MIMO capacity under correlated fading [4]-[14]. However, the exact analytical results for 
the capacity such as ergodic (or mean) capacity, capacity variance, and outage capacity (i.e., 
capacity versus outage probability) 1 have been known for only a few special cases, largely due 

'in general, the capacity distribution is required to determine the outage capacity [2], [3], [15]. 



2 



IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. XX, NO. X, MONTH 2005 



to mathematical intractability (see, e.g., [3], [6], [16] for i.i.d. flat Rayleigh fading and [7]-[9] for 
a one-sided correlated MIMO channel). For a more general case of correlated fading at both the 
transmitter and the receiver, which we will refer to as doubly correlated MEMO channels in the 
paper, some limited results are available: the capacity distribution for a small number of antennas 
(i.e., min{?2 T ,72 R } < 3 where n T and n R are the numbers of transmit and receive antennas, 
respectively) [10], upper and lower bounds on the ergodic capacity [6], [12], capacity statistics 
for the case with a large number of antennas [13], and the asymptotic mean and variance of the 
capacity in the limit as the number of antennas tends to infinity [5], [14]. The temporal behavior 
of the capacity was analyzed in [11] in terms of level crossing rates and average fade durations. 

In this paper, we focus on deriving the exact analytical expressions for capacity statistics of 
doubly correlated MIMO Rayleigh-fading channels using the methodology developed in [6] and 
[7], assuming perfect channel knowledge at the receiver and no knowledge at the transmitter 
with the average input-power constraint. The principal contributions of this paper are as follows. 

• We derive a determinant representation for the characteristic function (CF) of MIMO 
capacity, which generalizes the previous results for i.i.d. and one-sided correlated channels 
[7]-[9] to the doubly correlated case. 

• We derive trace representations for the mean, variance, and higher-order statistics (HOS) 
(e.g., cumulants, skewness, and kurtosis) of the capacity using the determinant representation 
of the CF and the relationship between polymatrices and dimatrices. 2 

• We characterize the effect of fading correlation on the capacity statistics at high signal- 
to-noise ratio (SNR). We show that at high SNR, the variance, skewness, kurtosis, and 
other HOS of the capacity depend only on correlation at the side with the larger number of 
antennas. Moreover, when n T = n R , these statistics are not affected by fading correlation 
at any side. 

To verify our analytical results, we also compare them with Monte Carlo simulations for doubly 
correlated MIMO channels using a correlation model based on physical measurements [17], [18] 
as well as a classical exponential correlation model. It should be noted that alternative derivation 
of the moment generating function (MGF) of the capacity for doubly correlated MIMO channels 
can also be found in [19]. In this study, the MGF was obtained indirectly from the case of a 

2 The definitions of the polymatrix and the dimatrix will be introduced in Section ITTT1 
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square channel matrix (n T = n R ) using the limiting approach of [20] and then, the first moment 
(ergodic capacity) was deduced from it in terms of a sum of min {tit, Wr,} determinants. 

The remainder of the paper is organized as follows. A brief overview of the distributions 
of complex random matrices required for our analysis, channel model, and associated channel 
capacity are presented in Section HH The CF of the capacity is derived and the capacity statistics 
are analyzed for doubly correlated MIMO channels in Section |in| The effect of fading corre- 
lation on the capacity statistics is investigated at high SNR in Section |IVJ In Section |Vj some 
numerical and simulation results are provided to illustrate our analytical results. Finally, Section 
IVj concludes the paper. 

We shall use the following notation throughout the paper. N and C denote the natural numbers 
and the field of complex numbers, respectively. The superscript f denotes the transpose conjugate. 
I n and tr (A) represent the n x n identity matrix and the trace operator of a square matrix A, 
respectively. By A > 0, we denote that A is positive definite. For a matrix A (£) = [ay {t)\ 
where ay (i) are differentiable functions of t, the nth derivative of A (t) with respect to t is 
denoted by A< n > (t) = [d n a hj (t) /dt n ]. 

II. Preliminaries: Definitions and Models 

In this section, we give a brief overview of the distribution theory of complex random matrices 
(that serves as a central mathematical tool to analyze MIMO communication systems), channel 
model, and associated channel capacity. 

A. Distributions of Complex Random Matrices 

Let us denote a complex Gaussian matrix X G C mxn with the probability density function 
(PDF) [6, eq. (1)] 

Px (X) = 7r-™Met (Epdet (*)~™ ^H^Cx-m^cx-m)^ (1) 

by X ~ AL„ (M, S, *) where £ G C mxm > and * G C nxn > are Hermitian. If 
X ~ M m ,n (0, S, I n ), m < n, and Y = XX^, then Y has a complex (central) Wishart density 
W m (n, S) given by [6, eq. (3)]. 

Definition 1 (Quadratic Form in the Complex Gaussian Matrix [6]): Let 

X ~ Af m>n (0, £, , m < n. 
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A positive-definite quadratic form Y in X associated with a Hermitian matrix A G C" xrt > 0, 
denoted by Y ~ Q m , n (A, S, is then defined as Y = XAX f . 

The PDF of Y ~ Q m ,„ (A, E, is given by [21, eq. (57)] and can be expressed in an 
equivalent form 

p Y (Y) = — !— det (£p det (A*)" m det (Y) n ~ m F W (-E^Y, * 1 A" 1 ) , Y > (2) 

r m (n) 

where f m (a) = vr" 1 ^™" 1 ^ 2 n^t) 1 T (a — £), Re (a) > m — 1, is the complex multivariate gamma 
function, T (■) is the gamma function, and (■) is the hypergeometric function of two 

Hermitian matrices, defined by [21, eq. (51)]. Note that the density © is a counterpart of 
the real case in [22, eq. (7.2.5)] and if A* = I n , it reduces to the complex Wishart density 

VV w (n,£). 

B. Channel Model and Capacity Random Variable 

We consider a point-to-point frequency-flat fading MIMO link with n T transmit and n R receive 
antennas. Let x 6 C™ T be a transmitted signal vector with input co variance Q = Ejxx^} 
satisfying the power constraint tr (Q) < V, then the received signal is given by 

y = Hx + n (3) 

where H e C" RX " T is the random channel matrix whose (i,j)th entries Hy, i = 1,2, . . . 
j = 1,2, ...,nx, are complex propagation coefficients between the jth transmit antenna and 
the ith receive antenna with E{|f/y| 2 } = 1, and n is the complex riR-dimensional zero-mean 
additive white Gaussian noise (AWGN) vector with the covariance matrix cr 2 I nR . For doubly 
correlated MEMO channels, the channel matrix H can be written as [4], [5] 

H = *^ /2 H uc *^ /2 (4) 

where H uc ~ AT nn>riT (0, I nR , I nT ), and * T e C nTX " T > and * R e C nRXnR > are the 
transmit and receive correlation matrices, respectively. Note that H ~ J\ nK ,m (0, ^r, ^t) and 
it has been used in various attempts for studying correlated MIMO channels [4]-[14]. Recently, 
this model has also been validated through physical measurements [17]. 
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In what follows, we refer to n s = min {n T , n R } and n L = max{n T ,nR} and define the 
random matrix © £ c™sxn s > q as 



HH f , if n R < n T 
H+H, otherwise. 



Also, let us denote, for convenience, 

(*S,*l) = ( ( * R '* t) ' if ^^^ 

I (\I/ T , Vl> R ) , otherwise 

and let < As,i < As, 2 < • • • < As, ns and < Al,i < Xl,2 < ■ ■ ■ < ^L,n L be distinct 
ordered eigenvalues of SE^ and \I>l, respectively. Then, ~ Q ns ,n L (In L , ^s, ^l) for the doubly 
correlated MIMO channel. 

In general, when the receiver has perfect channel knowledge, the input distribution that 
maximizes the mutual information between x and y is circularly symmetric complex Gaussian 
for any given input covariance Q. When the transmitter has no channel knowledge, power among 
transmit antennas cannot be allocated in accordance with the realization of H to maximize the 
mutual information, and hence equal power allocation to each of transmit antennas is the most 
reasonable strategy, i.e., choosing Q = (V/ni)I nT . 3 This yields the capacity in nats/s/Hz as 
[2], [3] 

C = In det (I ns + (r]/n T ) 0) (5) 

where 77 = Vja\ is the average SNR at each receive antenna. Since the channel matrix H 
is random, the associated channel capacity C is also a random variable whose statistics are 
determined by the statistical properties of the eigenvalues of ~ Q ns ,n L (In L , 

III. Capacity Statistics 

In this section, we will investigate the statistical properties of the capacity random variable C 
in © for doubly correlated MIMO channels. We begin by deriving the CF of C, from which 
all other functions, such as the PDF, cumulative distribution function (CDF), and cumulant 
generating function (CGF), and statistical moments of C can be obtained. 

3 It has been shown in [3] that if the channel has i.i.d. Rayleigh fading between antenna pairs, the optimum input covariance 
matrix is Q = (V/n,T)In T - 
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A. Characteristic Function 

Theorem 1: Let H ~ A^ nR , nT (0, * R , * T ), i.e., ~ Q nSi „ L (I„ L , * s , * L ). Then, the CF of 
the capacity C in nats/s/Hz is 

= K-lT ns (ju) detAW (6) 

where j = v / — T and 

/ \ ns(ns-l)/2 

ns-l 

and A (juj) is the n L x n L matrix whose (i, j)th entry is given in Table 0] 

Proof: See Appendix B. □ 

Note that Theorem [l] requires correlation matrices S&t and ^ R to have distinct eigenvalues. 
The case when the correlation matrices have non-distinct eigenvalues (some of Ag/s or Al/s 
are equal), we can obtain the CF as a limiting case of © [7], [23]. In particular, when \I> T = I„ T 
and * R = I„ R (i.i.d. case), <$> c M is given by [7, eq. (25)] 

$cM = K£detn(ju) (9) 

where Km = YYeli ( n L — — 1)\ and Q {juj) is the n$ x ns Hankel matrix whose (i,j)th 
entry is given in Table [Q Theorem [l] generalizes the previous results for i.i.d. and one-sided 
correlated channels (which are special cases of non-distinct eigenvalues) [7]-[9] to the doubly 
correlated MIMO channel given by ©. Using the analytical formulas for the CF in © and ©, 
the PDF and CDF of C can be expressed in forms of the inverse Fourier transform of $ c (jw), 
which can be efficiently calculated by using the fast Fourier transform (FFT) method [7], [24]. 

B. Mean, Variance, and Higher-Order Statistics 

From the CF of C in © and © involving the determinants, we derive the exact closed-form 
expressions for the mean, variance, and other HOS such as cumulants, skewness, and (excess) 
kurtosis of the capacity. To do this, we first introduce the logarithmic derivative of a determinant. 
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1 ) Logarithmic Derivatives of a Determinant: Let R (t) be a matrix depending on a parameter 
t. If each entry of R (t) is differentiable with respect to t, then so is det R (t) because the 
determinant is a polynomial in the entries of R (t). If R (t) is invertible, the first-order logarithmic 
derivative of det R (t) is given by [25] 

(ilndetRft) , -, , , . , , 

^— = tr {R- 1 (t) R« (t) } . (10) 

We now generalize (fTTjl) to the arbitrary order of differentiation. 

Definition 2 (Polymatrix and Dimatrix): Let R (£) be an invertible matrix whose elements are 
differentiable with respect to t. Then, the nth polymatrix of R (i) with respect to t is defined as 

R [n] (t) 4 R 1 (t) RW (t) . (11) 

In particular, we call Rpj (t) the dimatrix of R (t) . 

Lemma 1: The polymatrices and the derivatives of the dimatrix of R(£) have the following 
relationship 

r h (*) = E ( / 1 1) R [^i (*) R m _1) (*) W 

and the Mi logarithmic derivative of det R (£) is the trace of the (£ — l)th derivative of the 
dimatrix Rni (i), i.e., 

^y' ^K"'")' <i3) 

Proof: By definition, we have 

^ „ i , x rf n_1 R (1) (*) 

R N (t) = R (t) — t 

n 



£=1 v 7 

=e(":J)»m<«) «&-"(*) 



where (a) follows from the Leibniz's identity [26, p. 21] and (b) follows from (fTTT) . Also, (fl~3l 
follows immediately from (fT0"l) . (fTTT) . and by interchanging the order of differentiation and trace 
operators. □ 



8 



IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. XX, NO. X, MONTH 2005 



This lemma says that the ith logarithmic derivative of the determinant of a matrix can be 

determined by its first i polymatrices. For example, the second, third, and fourth order logarithmic 

derivatives of det R (t) are given by 

G? 2 lndetR(t) , , . ~ . , -, 

^— ^2 = tr {R [2] (t) - RJ, (t)} (14) 

d3ln ^3 R(t) = tr {2Rf 1] (t) - 3R W (t) R [2] (t) + R [3] (t)} (15) 

<i 4 lndetR(t) r , , ^„^o , N , s 
^— ^1 = tr {-6RJ] (t) + 12RJ] (*) R [2] (t) 

-3Rf 2] (t) - 4R [1] (t) R [3] (t) + R [4 ] (*)} • (16) 

Using the explicit determinantal CF's in Section UlI-AI and Lemma d we now derive statistical 
moments of C, which requires determining the polymatrices 

n M (1/) = n- 1 (z/) (1/) , n [0] (1/) = ins ( 17 ) 

A w (u) = A- 1 (u) AW (i/) , A [0] 0) = I„ L (18) 

for i.i.d. and doubly correlated MIMO channels, respectively, where the (i, j)th entries of (z/) 
and A' n ' (1/) for n G N are given in Table 0] 

2) Cumulants: The nth cumulant of C is by definition expressed as 

(19) 



i/=0 



where K-c iy) — In $c* (^) is the CGF of C. Note that the first and second cumulants are the 
mean and variance of the capacity, respectively. 

Theorem 2: Let H ~ A/" na ,n T (0,I„ R ,I„ T ), i.e., ~ W„ s (n L ,I„ s ) (n R x n T i.i.d. MIMO 
channel). Then, the nth cumulant of the capacity C in nats/s/Hz is 

« n = tr{nf;- 1) (0)}. (20) 
Proof: It follows immediately from ©, (fl9l) . and Lemma [T] □ 

Theorem 3: Let H ~ A/"„ R , nT (0, * R , * T ), i.e., ~ <2„ S ,„ L (I« L , *s, *l) ("k x ™t doubly 
correlated MEMO channel). Then, the nth cumulant of the capacity C in nats/s/Hz is 

ng — 1 

Kn = tr {a^ (0)} + (-1)" (n - 1)! ]T £~ n+1 . (21) 
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A 

m n = 



Proof: It follows immediately from Theorem [H (fl9l) . and Lemma [T] □ 

Using the relationship (IT2T) . the matrices (z/) and A&| (z/) in Theorems |2] and |3] can 

be determined by the polymatrices in (fTTl) and (fTSt . respectively. 

3) Raw and Central Moments: The nth raw and central moments (i.e., moments about the 
origin and the mean, respectively) of C can be obtained from the general relationships between 
moments and cumulants [27]: 

E {C n } = Z J) m ^ Ki {22) 

u„ = E {(C - mi)"} = y^() m n _ e (-mi)' (23) 

where m„ and fi n are the nth raw and central moments of C, respectively. In particular, the 
ergodic capacity is given by m 1 = E {C}. 

4) Skewness and Kurtosis: The skewness characterizes the degree of asymmetry of a distri- 
bution around its mean and the kurtosis measures the peakedness or flatness of a distribution 
relative to a Gaussian distribution. The skewness and (excess) kurtosis of C, denoted by p 1 and 

respectively, can be obtained from the cumulants in Theorems |2] and |3] for i.i.d. and doubly 
correlated cases as 

/3 2 ^-3 = % (25) 

Using ([T4l)-([T6l). d20b . (I2T1) . (l24b . and d25l) . the trace representations for the mean, variance, 
skewness, and kurtosis of the capacity are tabulated in Tables |n| and |in| for i.i.d. and doubly 
correlated MIMO channels, respectively. 4 Since the skewness and kurtosis of a Gaussian distri- 
bution are equal to zero, nonzero values of these quantities indicate the degree of deviation from 
the Gaussian distribution. 5 

4 Alternative expressions for the ergodic capacity and capacity variance of the i.i.d. case, in terms of integrals involving the 
Laguerre polynomials, can be found in [3] and [16], respectively. 

5 The skewness and kurtosis of the capacity have been used in conjunction with the Gram-Charlier expansion to estimate the 
Kullback-Leiblier divergence, as a measure of non-Gaussianity, between the capacity distribution and its Gaussian approximation 
[24]. 
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IV. Effect of Correlation at High SNR 

The effect of fading correlation on the behavior of capacity is not immediately apparent from 
the exact analytical expressions such as the CF in © and the moments in Table ITjTI Therefore, 
we resort to the asymptotic analysis to investigate such an effect in the following. In particular, 
we consider a high-SNR regime since the benefits of the use of multiple antennas are more 
pronounced at high SNR. In this case, the capacity © can be written as 

C = n s In fa/nx) + In det (0) + O (I/77) , (26) 

which reveals that at high SNR, the capacity is characterized by the logarithmic generalized 
variance In det (0) of ~ Qn s ,n L (In L > *s> Note in d2oT) that min {n T ,n R } (more pre- 
cisely, the rank of H) determines the spatial multiplexing gain of a MEMO channel, while the 
logarithmic generalized variance In det (0) determines the diversity gain in capacity point of 
view. Consequently, if the correlation matrices have full rank, then the channel matrix H in © 
has full rank with probability one. In this case, antenna correlation does not diminish the spatial 
multiplexing gain and only decreases the diversity gain [6]. 

Starting with d2oT) and using similar steps in the proof of Theorem [H the CF of the capacity 
in the high-SNR regime can be written as 

n z3 i n ~ det ( s ) dzidz2 ■ ' ■ dz ^ 

_<*l<-<z„ s <oo^ =1 l<i<j<n s 

= A |n r ^ + £ )} det KM (27) 

where 

A = fr/nxr* det (gsT r2R , 

UZi r (i) Ui<i<j<^ (Aw - a m ) 1 

and K (jju) is the 71l x matrix whose (2, j)fh entry is given by 

[ X lT *> 1 = n h - n s + l } . . . } n L} j = I, . . . } n L . 
Also, from Lemma [H (fl9l) . and (l27b . the nth cumulant of the capacity in nats/s/Hz at high SNR 
becomes 



I Tj high 



tr{K| 1 r ]- 1) (0)}+5 ln - nslnf J +lndet(* s ) + X> (n_1) W (30) 
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where 8y — { ' \ J- is the Kronecker delta and ijj^ (z) = d n+1 In T (z) /dz n+1 is the polygamma 
function. 6 In particular, if n R = tit, (l30l) reduces to 



l ^" a I r] high *^ lr 



J + In det (* t *r) + J] W • < 31 ) 



From (QUI and dTTT ). we have the following observations. 

• At high SNR, the mean capacity decreases by the amount of In det (\^s) due to correlation 
at the side with the smaller number of antennas and by the amount of tr |K[ 1 ](0)| due to 
correlation at the side with the larger number of antennas. 

• The variance, skewness, kurtosis, and other HOS of the capacity depend only on correlation 
at the side with the larger number of antennas and they converge to finite quantities 
determined by d2"2l -(l23t and ( BUI) , as r\ — > oo. 

• When n R = n T , fading correlation at any side does not affect the variance and HOS in 
the high-SNR regime, while the mean capacity decreases by the amount of In det (v&t^r)- 
Moreover, it follows from d24l . d25t . and (TJTT) that 

12V6-C(3) 



7T 3 



<3 1 \ .. . < and < B 2 \ .. . < 2.4 

— ' \r) high ' \r] high — 



which imply that the capacity distribution has an asymmetric tail extending out more to the 
left of its mean and is leptokurtic (i.e., more peaked than a Gaussian distribution) in the 
high-SNR regime. 

• For a single-input single-output (SISO) case (n T = n R = 1), we have 

mil . = In (rf) — 7, u 2 | K . h = — , K . h = — — , fh\ u- u = 2.4 

I »7 high v ' ' 11 " 1 1] high g > ' I r\ high ^3 ' ' \r\ high 

which reveal that — C at high SNR follows the extreme value distribution [27]. 



6 For 2 6 N, the digamma function (2), trigamma function ip^ [z), tetragamma function tp^ (2), and pentagamma 
function ip^ (2) can be expressed as 

^ (0) (*) = -7+X;i, ^ W = ^-g^, V (2) (-) = -2C(3) + g ; | J ^> W = g-g^ 

n — 1 n— 1 n — 1 n— 1 

where 7 « 0.5772156649 is the Euler-Mascheroni constant and £ (3) « 1.2020569 is Apery's constant. 
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V. Numerical and Simulation Results 
To illustrate our analytical results, we consider the exponential correlation model 

^ xp) (p)=[p li -%, j=1>2 ,..., n , pe[0,i), 

as well as the multiple element transmit receive antennas (METRA) model [17] in our numerical 
examples. The former model is reasonable in the case of the equally-spaced linear array. The 
latter model characterizes the correlation properties of MIMO channels using a reduced set 
of physical parameters such as antenna spacing, power angular spectrum, azimuth spread, and 
angle of arrival. This model was validated based on measured data collected in both picocell and 
microcell environments [17], and also has been proposed recently for mobile broadband wireless 
access (MBWA) MIMO channels [18]. 

A. Exponential Correlation Model 

In all examples for exponential correlation, we set \£t = $nT P ' (pt) an d = $1r P ' (Pr)- 
Fig. d shows the PDF of C for i.i.d. and exponentially correlated (px = 0.5, pr = 0.7) MIMO 
channels at r] = 15 dB when = n R = 3. The analytical curves are plotted by using ©, ©, 
and [24, eq. (4.33)]. We also compare our analytical results with the simulated PDF obtained by 
generating 100 000 realizations of H. It can be seen that analytical and simulated curves match 
exactly. The figure also shows that the mass of the PDF is mostly above a certain level due to 
the spatial multiplexing gain (for example, 4 nats/s/Hz for the exponentially correlated case and 
5 nats/s/Hz for the i.i.d. case). 

Fig. El shows the CDF of C for exponentially correlated MIMO channels with px = 0.5 and 
Pr = 0.7 at 7] = 15 dB when n<? = = 2, 3, 4, and 5. The analytical curves are plotted 
by using © and [24, eq. (4.34)], and they agree exactly with the simulated ones. It can be 
seen that the capacity increases linearly with the number of antennas for the entire range of 
cumulative probability, despite the presence of correlation. This can be attributed to the spatial 
multiplexing gain achieved by increasing the number of antennas at both sides. For example, 
the capacity at the cumulative probability of 0.1 (i.e., 10% outage capacity) is about 3.76, 5.95, 
8.12, and 10.30 nats/s/Hz for n T = n R = 2, 3, 4, and 5, respectively; we can gain approximately 
2.18 nats/s/Hz of additional capacity for each increase in the number of antennas at both the 
transmitter and the receiver. Also, the spatial multiplexing gain of MIMO systems guarantees 
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a certain transmission rate at arbitrarily low outage probability (e.g., 2.00, 4.18, 6.36, and 8.54 
nats/s/Hz for wt = wr = 2, 3, 4, and 5, respectively). Fig. |3] illustrates the effect of exponential 
fading correlation on the capacity distribution for the case of tit = tir = 3 and i] = 15 dB, where 
Pt and p R range from to 0.9. It can be seen that the decrease in capacity due to exponential 
correlation is negligible for a small amount of correlation, but it becomes more significant as the 
correlation coefficient increases. Moreover, the capacity reduction is more pronounced at high 
cumulative (or outage) probability. 

Fig. E| shows the analytical and simulated mean, variance, skewness, and kurtosis of C for 
exponentially correlated MIMO channels as a function of correlation coefficient p for p T = 
Pr = p, n T = n R = 3, and 77 = 15 dB. Again, our analytical results are in excellent agreement 
with Monte Carlo simulations that are carried out by generating 100 000 realizations of H. Fig. 
|5] shows the mean, variance, skewness, and kurtosis of C versus SNR for i.i.d. and doubly 
correlated (p T = 0.5, p R = 0.7) MIMO channels when n T = n R = 2. It can be seen that as the 
SNR increases, the variance, skewness, and kurtosis for both i.i.d. and doubly correlated cases 
converge to 2.290, —0.810, and 1.333 according to j24b . d25t . and d3TT) . respectively, and the 
effect of fading correlation on these statistics diminishes. 

B. METRA Correlation Model 

We now consider 4x4 MIMO channels with correlation matrices obtained by the METRA 
model [17], [18]. In all examples, it can be observed that our analytical results agree exactly with 
Monte Carlo simulations. Fig. |6] shows the ergodic capacity for METRA correlation matrices, 
given in [17, p. 82], for picocell and microcell environments. The picocell example is a partially 
decorrelated scenario selected from a small office environment, whereas the microcell case 
corresponds to an environment where the receiver (base station) is highly correlated (see [17] 
for details on the antenna configurations and environment setups). We can see from Fig. |6] that 
the ergodic capacity at 77 = 15 dB is 11.25, 9.44, and 6.22 nats/s/Hz for i.i.d., picocell, and 
microcell cases respectively. The reduction in ergodic capacity due to spatial correlation is about 
16% for picocell and 45% for microcell environments, respectively. Fig. |7] shows the CDF of 
the capacity at 77 = 15 dB in the same environments as in Fig. The 10% outage capacity 
is 9.82, 8.21, and 5.33 nats/s/Hz for i.i.d., picocell, and microcell scenarios respectively. The 
reduction in 10% outage capacity due to spatial correlation is about 16% for picocell and 46% 
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for microcell environments, which is similar to the amount of reduction in ergodic capacity. 

We next consider the METRA correlation matrices, given in [18], for 4x4 MIMO channels 
in macrocell Pedestrian A and Vehicular A environments of the international telecommunication 
union (ITU) standard. For these correlation matrices, the ergodic capacity is shown in Fig. [8] and 
the CDF of the capacity at r] = 15 dB is shown in Fig. |9l The ergodic capacity at 77 = 15 dB is 
6.64 nats/s/Hz for the ITU Pedestrian A and 7.11 nats/s/Hz for the ITU Vehicular A, respectively 
(see Fig. HJ). Also, 10% outage capacity is 5.67 and 6.01 nats/s/Hz for each environment (see 
Fig. IS- The reduction in ergodic capacity and 10% outage capacity due to spatial correlation is 
about 40% in both environments. 

VI. Conclusions 

In this paper, we derived closed-form formulas for the exact capacity statistics of Rayleigh- 
fading MIMO channels in the presence of spatial fading correlation at both the transmitter and the 
receiver. In particular, we derived the determinant representation for the characteristic function 
(Theorem [T]) and the trace representations for the cumulants (Theorems El and |3j of MIMO 
capacity as well as the mean, variance, skewness, and kurtosis (Tables ITTl and ITTTb . These results 
are valid for arbitrary numbers of antennas, enabling us to calculate both the ergodic capacity 
and the outage capacity without any approximation and generalizing the previous results for 
i.i.d. and one-sided correlated MIMO channels. We also showed that in a high-SNR regime, the 
variance, skewness, kurtosis, and other higher-order statistics of the capacity depend only on 
correlation at the side with the larger number of antennas. Moreover, when the antenna topology 
is symmetric (i.e., = tir), these statistics are not affected by fading correlation at any side 
and the capacity distribution has negative skewness greater than or equal to — 12\/6 • ( (3) /V 3 , 
where £(3) ~ 1.2020569 is Apery's constant, and positive (excess) kurtosis less than or equal 
to 2.4. This implies that the capacity distribution has an asymmetric tail extending out more to 
the left of the ergodic capacity and a leptokurtic shape more peaked than a Gaussian one. To 
illustrate our analytical results, we presented numerical examples using the correlation model 
based on realistic channel measurements as well as the classical exponential correlation model. 
These examples showed that our analytical results are in excellent agreement with Monte Carlo 
simulations and that a considerable decrease in capacity, due to spatial fading correlation, can 
be observed in realistic MIMO channels. 
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Appendix A: Integral Identities 



Let us define the integrals Q n (a, b, £) and J n ^ (a, 6, £) as 

POO 

^(0,6,0=/ (1 + ax)* -1 x^V^dx, a, 6 > 0, neN, ^ G C (32) 



Jn/ (o,6,0 = ™ n (°;> b '® = I (l + axf~ 1 \n i {l + ax)x n - l e- x ' b dx (33) 



which appear in deriving the analytical expressions for the capacity statistics in Section |nij 

From the integral representation of the confluent hypergeometric function \P (a, b; z) in [26, 
eq. (9.21 1.4)] and the identity 2 F (a, b; -z' 1 ) = z a ^> (a, a - b + 1; z), the integral Q n (a, b,g) 
can be evaluated as 

G n (a, b, = a~ n {n - 1)!* (n, n + £; ^ 

= 6 n (n-l)! 2 F (n,-£ + l;-a6) (34) 



where p F g (ai, 02, • • • , %>; 61, 62, • • • , -2) is the generalized hypergeometric function [26, eq. 
(9.14.1)]. In particular, for (GN, (l34l reduces to a finite sum of elementary functions as 

g n (a, 6, = & n E ~ X ) iflbf (n + k — 1)!. (35) 

Since the derivatives of the generalized hypergeometric function with respect to its parameters 
are not known, in general, the integral J n> e(a, b, £) cannot be evaluated directly from (P4"l) . 
However, £ n (a, 6, £) for a, b > 0, n G N and £ G C can be expressed in an alternate form 

p l/(o6) n_1 /„ _ 1 \ / 1 \ 

&( ,U) = — £ ^ (-in<r (H 5 (36) 
fc=0 v 7 v 7 

where T (a, z) = J z °° e~ x x a ~ 1 dx is the complementary incomplete gamma function [26, eq. 
(8.350.2)]. Then, using d36t and Leibniz's identity [26, p. 21], the integral J nt i{a,b,^) defined 



16 



IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. XX, NO. X, MONTH 2005 



in (l33l can be evaluated as 

e l/(a6) ^ 

J n/ (a, b,0 = — — ^ 

fc=0 



-1 



,n— k— 1 



n — 1 

A; 



i=0 



aby 

Qi 



da 1 



r ( a, — 



fc=0 . 



fc— 1 



n — 1 



e+i i's 



1,1,..., 1 

o,o,...,o,e + fc 

e+i o's 



(37) 



where G™f 
reduces to 



is the Meijer G-function [26, eq. (9.301)]. In particular, for t = £ = 1, 07]> 



J- n)1 (a, 6, 1) = 6* (n - lJleVW ]T (a&)~ fe T -i) . 

k=o \ a ' 



(38) 



Appendix B: Proof of Theorem Q] 

To proceed with the proof of Theorem [T] we begin by evaluating an integral involving matrix 
determinants, which is a continuous analogue of the well-known results in multivariate analysis 
[28]. The next lemma adds a new identity to the list of the generalized Cauchy-Binet formulas 
derived in [7, Appendix]. 

Lemma 2: Suppose that /, and gj, i = 1, 2, . . . , m, j = 1, 2, . . . , n, m < n, are arbitrary 
integrable functions over D. Let F (gi, g 2 , . . . , g m ) and G (qi, g 2 , ■ ■ ■ , Q m ) bemxm and n x n 
matrices whose entries depend on g x , g 2 , . . . , Q m , given by 

V? {Qi, 02, ■ ■ ■ , Qm)}^ = fj(Qi), hJ = 1,2,..., m (39) 

{Ci ,■ , % = 1 , . . . , n — m, 7 = 1 ..... n 

( 40) 
9j {Oi-n+m) , i = n - m + 1, . . . , n, j = 1, . . . , n 

where scalar constants. Then, 

/ • • • / det F (g 1} g 2 , . . . , Q m ) det G(g 1 ,g 2 ,..., g m ) h (g e ) dg x dg 2 ■ ■ ■ dg m = ml det (<&) 

(41) 



SHIN et at: ON THE CAPACITY OF DOUBLY CORRELATED MIMO CHANNELS 



17 



where h (■) is an arbitrary function and $ is the n x n matrix with (i, j)th entry 0^- given by 

{Cj,-, i — 1, . . . , n — m, j — 1, . . . , n 

(42) 
J s /i_„ +m (e) 9j (q) h (g) dg, i = n - m + 1, . . . , n, j = 1, . . . , n. 

Proof: Let a = (ai, a 2 , . . . , a„) and b = (&i, 6 2 , • • • , b m ) be the permutations of integers 

l,2,...,n and 1,2, ...,m, respectively. Then, the integration of the left-hand side of (I4TT) . 

denoted by X, becomes 



n—m m 



X = ^ sgn (b) sgn (a) JJ c i>ai JJ <j) n - m+b] , a „_ m+ . , (43) 

b a i=l j=l 

where sgn (•) denotes the sign of the permutation. Note that the sequence 1, 2, . . . , n — m, bi + 
n — m, 62 + n — m, . . . , b m + n — m is a permutation of 1, 2, . . . , n with the sign equal to sgn (b). 
Therefore, using [7, eq. (38)], we get 

X = ^ det ($) = ml det (*) . (44) 

b 

□ 

Proof of Theorem [7} The CF of C can be written as 

$c M = / det (I ns + 77©)^ p© (0) rf© (45) 

j©=©t>o 

where 77 = r//n T and from ©, the PDF of ~ Qn s ,n L (In L , *s, *l) is given by 

p & (0) = ?r ±— det (* s )" nL det (* L )- ns det (0)" F ( " l) (-* s 1 0, ^) . (46) 

The typical approach for the evaluation of the integral in (1431 is to perform eigenvalue decompo- 
sition using the unitary transformation of and to exploit the knowledge of the joint eigenvalue 
distribution of 0. However, the correlation matrix in the argument of the hypergeometric 
function in (|4"6T) prevents the removal of the unitary matrix from its arguments after the eigenvalue 
decomposition, which makes it difficult to directly use the joint eigenvalue distribution of 0. 
We alleviate this difficulty by performing two successive transformations as follows. The first 
transformation is given by Z = ^ s 1 ^ 2 0^' s with Jacobian d& = det (\E r s)" s dZ. Using the 
fact that 

F^ (-* s 1 0, * L X ) = F (nL) (-*s 1/2 0*s 1/2 , ^l 1 ) , (47) 
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we have 

Het fvi/ T ^r ns r 

®c M = ~ l . L \ / det (I ns + r^sZ) 

J- n s [ML) ^Z=Zt>0 

• det (Z)" L - ns F (nL) (-Z, ^l 1 ) <*Z. (48) 

Let us denote a unitary manifold of ns x ns unitary matrices with real diagonal elements 
by U (ns). Since Z is Hermitian, there exists U G U (ns) such that Z = UDU^ and D = 
diag (zi, z 2 , ■ ■ ■ , z ns ) where < Z\ < z 2 < ■ ■ ■ < z ns are ordered eigenvalues of Z. We then 
make the second transformation Z = UDU^ with Jacobian dZ = rii<i<j<ns ( z i ~ z ^ d^JdD 
[29, Theorem 3.1], [30, Theorem 4.4], yielding 

$c W = ^t^^ [ [ det (I ns + ^sUDUt)*" det (D)" 

J- n s (WlJ Jr> JU£U(n s ) 

■ 11 (z j -z i )\F^ ) (--D,^Z 1 )dUdD (49) 

l<i<j<ng 

where we have used the fact that the hypergeometric function with matrix arguments is invariant 
under unitary transformations of its arguments. 7 It is now apparent that the above two transfor- 
mations enable us to remove the dependence of U on the hypergeometric function. Recall that 
the total volume of U (n s ) is [30, Corollary 4.3.1] 

n s (n s -l) 

dU = -= . (50) 

ueu(n 3 ) T ng (ns) 

We can now carry out the integration with respect to U using [30, eq. (6.1.19)] and [21, eq. 
(52)] as 

r 7J -ng(ns — 1) _ 

/ det (I ns + Tj^sUDUt)*" dV = iF (ws) {-w D, -fj* s ) ■ (51) 

Substituting (EB into (03) gives 

7r «s(«s- 1 )det(* L )" ns r 



t n s (™Lj 1 n s (™Sj J0<Zi<-<Zn s <oo £=1 l<i<j<n s 



■ J™ (-ju; D, -77* s ) o^ (nL) (-D, dz 1 dz 2 ■ ■ ■ dz ns . (52) 



7 Note that Ili<i<j<n S ( Z J — JZi ) * s me ns x ns Vandermonde determinant of Zi,Z2, 



l<i<j <ng 
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Using the results in [23, Lemma 3] and [20], the hypergeometric functions with matrix arguments 
in the integrand of (l52l can be expressed in terms of determinants as 



K„ 5) ,_ - _^ , = v -W)-^- 1 " 2 f M det [(! + DAs^r*--'] „ s 

! 11 " n2 1 (-^-«s + i),_ 1 ni< i <,<„,h-^)(A SJ -A s 



<i<j<n s 

(53) 



-n s (n s -l)/2 rws ( Wl ) det (^ L )ns det gj 



F ( " l) (-D, = — -"s^w^m-w ^ (54) 



where (a) n = a (a + 1) • • • (a + n — 1), (a) = 1, is the Pochhammer symbol and S is the 
n L x n L matrix whose (i,j)th entry is given by 

Ar 1 , i = l,...,n L -n s , j = l,...,n L 

{S} *H , / \ (55) 

I Alj " s exp r L L j" S j ' i = n L-n s + l,...,n L , j = l,...,n L . 

Combining (EJl-lEl) together with 

n s ns—1 



\n s (ns-l)/2 



il (-jw - n s + l) t _ x = 11 (jtu + if , (56) 



we get 



$>c M = %^ / det [(1 + fjXsjZiY"**- 1 ] _ g det (3) dz, ■ ■ ■ dz ns 

-^cor Jo<Zi<---<z ns <oo 



= r ■ ■ ■ r det K 1 + ^ z ^ +ns '%=i,2,...^ ^ • • • (57) 

"-S'-^cor JO Jo 

where the last equality follows from the fact that the integrand is symmetric in z±, z%, . . . , z ns . 
Finally, applying the integral-type Cauchy-Binet formula in Lemma |2] to (BTl) and using the 
identity (E3> complete the proof. 
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TABLE I 

Some matrices involved in the analytical expressions for the capacity statistics (see Appendix A for 

DETAILS ON THE EVALUATION OF INTEGRALS). DENOTE fj = T)/m. 



Notation 


Dimension 


(i, j)th entry (where i — i — iil + ns and n G N) 


AM 


TIL X IlL 


{A (")} i= i j2l ... lTlL _ nS] j = i J 2,...,n L = ^L,j 

{A ( I/ )}i = nL - ns +l, nL _ ns -|_2,...,n L ,j = l,2,...,n L 
f oc 

= A Ly ns_1 y (i+ ^a s ,,«)" + " s " 1 e - z/AL ^d Z 

= A^"" s 2 F () (1, -i/ - n s + 1; »/.WA,.. , 


A (n) (v) 


nL X IlL 


{a m (;/)) =0 

I J i=l,2 nL-ns,3=l,2,.".ni 

{a"" H) 

k J i=n L — n s +l,n L — n s +2,...,n L ,j=l,2,...,n L 

/• oo 

= A^-" 8 " 1 / (1 + i/As,,*)"" 1 "" 8-1 In" (1 + jjAs,,*) e-'^dz 
Jo 

= n!(^ s ,0^" s - 1 A L 7- 1 exp(^-) 

w( ^n + 2,0 f 1 | 1,1,. ..,1 A 
x<J n+l,n+2 ^ *7A Sil A L>3 0,0, . . . ,0,u + n s J 


Sl{v) 


ns x ns 


{nM} w =i, a ,...,„ s 

= / (l+^) l ' Z nL - ns+i+ ^ 2 e- Z d Z 

Jo 

= (tz l - n s + i + j - 2)! 2 F (n L - ns + i + j - 1, -z/; -77) 


f2 (n) {y) 


ns x ns 


{n (n) (")} , 

/* CO 

= / (l+^) v ln n (l + f^)* nL - ns+<+, '-V z <fc 
io 

1 I/77 n L ~n s +i+j-2 " 
n - e / ^yi-L-ns+i+j-k-2 
fjHL-«S+'+]-l -^-^ ^ ' 
' fc=0 

/„ L -n s +t+j-2\ - I /+fc+l r .n+2,0 /l 1,1,. ..,1 \ 
X l fc J 7 ? l - r n + l,n+2 ^ 0,0,...,0,^ + fc + iy 
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TABLE II 

Mean, variance, skewness, and kurtosis of the capacity in nats/s/Hz for i.i.d. MIMO channels, 
H ~ 7Vn R ,„ T (0, I ). Denote ft n = Sl [n] (0). 



mi (mean) 


tr (fii) 




fi 2 (variance) 


tr (n 2 - n?) 




Pi (skewness) 


tr (217? - 3fiin 2 + n 3 


) 


[tr(fi 2 -fi?)] 3/2 




P2 (kurtosis) 


tr (-6nf + 12 f2 2 fl 2 - 


3fi 2 -4f2in 3 + f2 4 ) 


[tr(n 2 - 


-«l)] 2 



TABLE III 

Mean, variance, skewness, and kurtosis of the capacity in nats/s/Hz for doubly correlated MIMO 

CHANNELS, H ~ A/"„ R ,„ T (0, *r , * t ) • DENOTE A n = A [n] (0). 



mi (mean) 


tr (Ai) - (n s - 1) 


112 (variance) 


n s -l 

tr (A 2 — A?) + Y, r ' 

t=i 


Pi (skewness) 


tr (2A? - 3Ai A 2 + A 3 ) - E^f" 1 2T a 


[tr(A 2 -A?) + E2rV-f /2 


p2 (kurtosis) 


tr (-6A? + 12A 2 A 2 - 3A| - 4AiA 3 + A 4 ) + E^" 1 M~ 3 


[tr(A 2 -A?)+E?fr 1 ^ 1 ] 2 
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Fig. 1. PDF of the capacity for i.i.d. and exponentially correlated (pr = 0.5, pn = 0.7) MIMO channels, tit — ur = 3 and 
r) = 15 dB. 



SHIN et at: ON THE CAPACITY OF DOUBLY CORRELATED MIMO CHANNELS 



25 




Capacity (nats/s/Hz) 



Fig. 2. CDF of the capacity for exponentially correlated MIMO channels with pt = 0.5 and pr = 0.7. tit = ur = 2, 3, 4, 5 
and r] = 15 dB. 
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Fig. 3. CDF of the capacity for exponentially correlated MIMO channels with pt = Ph. = (i.i.d.), 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 
0.7, 0.8, and 0.9. n T = n R = 3 and r) = 15 dB. 
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Fig. 4. Mean, variance, skewness, and kurtosis of the capacity (nats/s/Hz) as a function of correlation coefficient p for 
exponentially correlated MIMO channels with p T = Ph. = p- «t = hr = 3 and ij = 15 dB. 
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Fig. 5. Mean, variance, skewness, and kurtosis of the capacity (nats/s/Hz) versus SNR for i.i.d. and doubly correlated (pt = 0.5, 
Pr = 0.7) MIMO channels. n T = n R = 2. 
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Fig. 6. Ergodic capacity of doubly correlated MIMO channels with tit = ur = 4 and METRA correlation matrices given in 
[17, p. 82] for picocell and microcell environments. For comparison, the ergodic capacity of the 4x4 i.i.d. MIMO channel is 
also plotted. 
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Fig. 7. CDF of the capacity for doubly correlated MIMO channels with «t = tir = 4 and METRA correlation matrices 
given in [17, p. 82] for picocell and microcell environments. For comparison, the CDF of capacity for the 4x4 i.i.d. MIMO 
channel is also plotted. 77 = 15 dB. 
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Fig. 8. Ergodic capacity of doubly correlated MIMO channels with tit = ur = 4 and METRA correlation matrices given 
in [18] for macrocell ITU Pedestrian A and Vehicular A environments. For comparison, the ergodic capacity of the 4x4 i.i.d. 
MIMO channel is also plotted. 
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Fig. 9. CDF of the capacity for doubly correlated MIMO channels with m = tir . = 4 and METRA correlation matrices 
given in [18] for macrocell ITU Pedestrian A and Vehicular A environments. For comparison, the CDF of capacity for the 4 x 4 
i.i.d. MIMO channel is also plotted, r/ = 15 dB. 



